Semi-Supervised Discriminant Analysis Based On Data Structure

نویسندگان

  • Xuesong Yin
  • Rongrong Jiang
  • Lifei Jiang
چکیده

Dimensionality reduction is a key data-analytic technique for mining high-dimensional data. In this paper, we consider a general problem of learning from pairwise constraints in the form of must-link and cannotlink. As one kind of side information, the must-link constraints imply that a pair of instances belongs to the same class, while the cannot-link constraints compel them to be different classes. Given must-link and cannot-link information, the goal of this paper is learn a smooth and discriminative subspace. Specifically, in order to generate such a subspace, we use pairwise constraints to present an optimization problem, in which a least squares formulation that integrates both global and local structures is considered as a regularization term for dimensionality reduction. Experimental results on benchmark data sets show the effectiveness of the proposed algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast semi-supervised discriminant analysis for binary classification of large data-sets

High-dimensional data requires scalable algorithms. We propose and analyze three scalable and related algorithms for semi-supervised discriminant analysis (SDA). These methods are based on Krylov subspace methods which exploit the data sparsity and the shift-invariance of Krylov subspaces. In addition, the problem definition was improved by adding centralization to the semi-supervised setting. ...

متن کامل

Semi-supervised Neighborhood Preserving Discriminant Embedding: A Semi-supervised Subspace Learning Algorithm

Over the last decade, supervised and unsupervised subspace learning methods, such as LDA and NPE, have been applied for face recognition. In real life applications, besides unlabeled image data, prior knowledge in the form of labeled data is also available, and can be incorporated in subspace learning algorithm resulting in improved performance. In this paper, we propose a subspace learning met...

متن کامل

Feature reduction of hyperspectral images: Discriminant analysis and the first principal component

When the number of training samples is limited, feature reduction plays an important role in classification of hyperspectral images. In this paper, we propose a supervised feature extraction method based on discriminant analysis (DA) which uses the first principal component (PC1) to weight the scatter matrices. The proposed method, called DA-PC1, copes with the small sample size problem and has...

متن کامل

Self-Supervised Learning for Object Recognition based on Kernel Discriminant-EM Algorithm

In Proc. of IEEE Int’l Conf. on Computer Vision, Vancouver, Canada, 2001 It is often tedious and expensive to label large training data sets for learning-based object recognition systems. This problem could be alleviated by selfsupervised learning techniques, which take a hybrid of labeled and unlabeled training data to learn classifiers. Discriminant-EM (D-EM) proposed a framework for such tas...

متن کامل

Semi-supervised learning in MCI-to-ad conversion prediction - When is unlabeled data useful?

This paper investigates the use of semi-supervised learning (SSL) for predicting Alzheimers Disease (AD) conversion in Mild Cognitive Impairment (MCI) patients based on Magnetic Resonance Imaging (MRI). SSL methods differ from standard supervised learning methods in that they make use of unlabeled data in this case data from MCI subjects whose final diagnosis is not yet known. We compare two wi...

متن کامل

Towards Self-Exploring Discriminating Features for Visual Learning

Many visual learning tasks are usually confronted by some common difficulties. One of them is the lack of supervised information, due to the fact that labeling could be tedious, expensive or even impossible. Another difficulty is the high dimensionality of the visual data. Fortunately, these difficulties could be alleviated by using a hybrid of labeled and unlabeled training data for learning. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015